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Image Demosaicing and Enhancement System 
Field of the Invention 

The present invention relates to digital cameras, and more particularly, to an improved 
method for converting data from a camera sensor to a color image. 

Background of the Invention 

A digital color image usually consists of an array of pixel values representing the intensity of 
the image at each point on a regular grid. Typically, three colors are used to generate the 
image. At each point on the grid the intensity of each of these colors is specified, thereby 
specifying both the intensity and color of the image at that grid point. 

Conventional color photography records the relevant image data by utilizing three 
overlapping color sensing layers having sensitivities in different regions of the spectrum 
(usually red, green, and blue). Digital cameras, in contrast, typically utilize one array of 
sensors in a single "layer". 

When only one sensor array is used to detect color images, only one color may be detected at 
any given sensor location. As a result, these sensors do not produce a color image in the 
traditional sense, but rather a collection of individual color samples, which depend upon the 
assignment of color filters to individual sensors. This assignment is referred to as the color 
filter array (CFA) or the color mosaic pattern. To produce a true color image, with a full set 
of color samples (usually red, green and blue) at each sampling location, a substantial amount 
of computation is required to estimate the missing information, since only a single color was 
originally sensed at each location in the array. This operation is typically referred to as 
"demosaicing". 
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To generate the missing information, information from neighboring pixels in the image 
sensor must be used. A number of algorithms have been put forward in an attempt to provide 
the missing information while minimizing artifacts resulting from the estimation process. 
5 The simplest algorithms interpolate the sensor data from like color sensors to provide the 
missing information. These algorithms treat the red sensors as being independent from the 
green sensors, and so on. To provide a red value at a given location, the values measured by 
the red sensors in the region of that location are interpolated. This approach requires that the 
image be low-pass filtered. Such filtering reduces the image resolution below the pixel 
y 10 resolution of the underlying sensor array. This lost resolution cannot be recovered. 

hs$a 

sj To avoid this loss in resolution, less aggressive optical low-pass filtering is used in some 

I y 

y3 higher-end cameras. However, in such systems, the color sensors may no longer be treated as 

H* independent. For example, Wober, et al. (U.S. Patent 5,475,769) describe a method for 

15 generating the missing color information by computing a weighted average of the pixel 

values in the neighborhood of the pixel whose missing color information is being computed. 

This method weights values from all of the color sensors, not just the color being 

reconstructed. However, even this approach leaves much to be desired since it utilizes one set 

of weights for all images. 

20 

A single pixel array may be viewed as consisting of a number of separate planes of pixels in 
which each plane has sensors for the same color. Since the pixels do not overlay, the sensors 
in the various planes are at different locations. Systems that take weighted averages across 
more than one plane make use of the statistical dependencies between these sample locations. 
25 In effect, the blurring of an image by the camera optics allows an image edge that falls on one 
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color plane precisely on the sensors of that plane to also be seen in the other color planes 
because the image is spread by blurring onto the sensors in the other color plane. Since the 
statistical dependencies between the various color planes depend on the amount of blur 
introduced by the camera optics, an optimal algorithm must take into account the physical 
5 camera settings. Accordingly, a single set of weight functions will not provide an optimal 
estimation of the missing information. 

The statistical dependencies also depend on the source of illumination. Different illumination 
sources have different spectra. The pixel filters have broad pass-bands centered at the red, 

10 green, and blue wavelengths. In the absence of any image blurring, the response of any given 
pixel is determined by its color filter, the reflectivity of the corresponding point in the scene 
being photographed, and the light spectrum incident on that point from the illumination 
source. The blurring provided by the camera optics mixes the light between the pixels. 
Hence, the statistical dependencies, in general, depend both on the illumination source and 

15 the camera optics. Prior art methods for converting the pixel array data to a fully sampled 
color digital image do not take the illumination source into account. 

Broadly, it is the object of the present invention to provide an improved image processing 
method for converting data from a pixel array having non-overlapping sensors to a fully 
20 sampled digital image. 



It is a further object of the present invention to provide a conversion method that corrects for 
the camera's optical system. 



It is a still further object of the present invention to provide a conversion method that corrects 
for the source of illumination. 

These and other objects of the present invention will become apparent to those skilled in the 
art from the following detailed description of the invention and the accompanying drawings. 



Summary of the Invention 



The present invention is a method for operating a data processing system to generate a second 
image from a first image. The first image includes a two dimensional array of pixel values, 
each pixel value corresponding to the light intensity in one of a plurality of spectral bands at a 
location in the first image. The method utilizes a linear transformation of a vector derived 
from super input pixels to obtain a vector that includes at least one super output pixel. The 
super input pixels are defined by separating the pixels of the first image into a plurality of 
input image planes. Each input image plane has an identical number of pixels within a 
normalized horizontal and vertical sampling interval as the other input image planes. All 
pixels in a given input image plane correspond to the same spectral band as the other pixels in 
that input image plane. Each super input pixel is a vector of dimension P, where P is the 
number of the input image planes, each component of that vector being an input pixel from a 
corresponding input image plane. Similarly, a set of output image planes is defined, each 
pixel in a given output image plane representing the intensity of the second image in one of a 
plurality of spectral bands at a corresponding point in the second image. Each super output 
pixel is a vector of dimension Q, where Q is the number of the output image planes, each 
component of that vector being a pixel from a corresponding output image plane. In the 
preferred embodiment of the present invention, the linear transformation depends on the 
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properties of the optical system and the illumination source used to generate the first image. 
The linear transformation can also be varied to take into account the contents of the scene 
captured in the first image and the desired output format of the second image. 

5 Brief Description of the Drawing s 

Figure 1 illustrates the separation of an image taken with an image sensor having a repeating 
2x2 pattern into image planes according to the present invention. 

UJ io Figure 2 illustrates a portion of a sensor array and the input pixels in the sensor array which 

I™ contribute to a particular intermediate input vector. 

v3 Figure 3 illustrates a portion of an output RGB (red, green, blue) image and the pixels in the 

H RGB output image that correspond to the intermediate output vector shown in Figure 2. 

™? Detailed Description of the Invention 

The method of the present invention may be applied to any color-sampling device that 
acquires its samples from a sensor array that can be decomposed into a plurality of image 
20 planes, which satisfy two conditions. First, each image plane must have an identical number 
of samples within a normalized horizontal and vertical sampling interval; however, the 
various image planes may be arbitrarily displaced relative to one another. Second, all 
samples in a given image plane must have identical color properties; however, multiple image 
planes can have the same color properties. 

25 
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These conditions are satisfied by any image sensor having a sensor pattern that is constructed 
by repeating a kernel of sensing elements. For example, one common image sensor array is 
based on the Bayer pattern, which generated by repeating a 2x2 sensor array kernel having 
two green sensors, one red sensor, and one blue sensor. This pattern is shown in Figure 1 at 
10. The kernel is shown at 12. Such an image sensor may be viewed as having 4 planes 
shown at 14-17, two green planes 14 and 17 ? one red plane 16, and one blue plane 15. The 
sampling interval is the area originally occupied by the kernel. Each of the planes is offset 
with respect to the other planes. It can be shown that any regular sampling lattice can be 
decomposed into a set of image planes satisfying the above conditions. 



To simplify the following discussion, vector notation will be utilized. Vectors and matrices 
will be shown in bold print to distinguish them from scalar quantities. The measured 
intensity values in each image plane will be denoted by Xp[n 1? n 2 ]. Here, n t and n 2 are indicies 
which denote the position of the pixel in the p* image plane and Xp is the intensity value 
15 measured for that pixel. The quantity [n 1? n 2 ] is a two-dimensional integer valued vector 
which will be denoted by n. The entire set of image planes can then be represented as a set of 
vectors x[n] where 



x 1 [n 1 ,w 2 ]" 

x 2 [n ]? w 2 ] 

x p [n l9 n 2 ] 



(1) 



20 The output image can likewise be represented as a set of vectors defined on a different set of 
image planes. Typically, the goal of the demosaicing algorithm is to generate a set of 
regularly spaced pixels in three color planes (red, green, and blue). Denote the intensity in 
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the i* color plane by y^n^]. Then the output pixels can also be represented by a set of 



vectors. 



y[m] = 



y^m^mj 
y 2 [m l9 m 2 ] 

y Q [m l9 m 2 ] 



(2) 



5 In the demosaicing case, Q is typically 3; however, different Q values may be utilized. For 
example, an image that is to be printed on a color printer utilizing 4 dyes could be generated 
directly by the method of the present invention utilizing a representation in which Q=4. 



In general, the output image will have a spatial resolution that is different from the input 
10 image. The input image may be viewed as consisting of a set of "super pixels", x[n]. 
Likewise, the output image is a set of pixels y[m]. The number of output pixels in the 
vertical and horizontal directions corresponding to each input pixel will be denoted by X x and 
X 2 , respectively. In the case of the Bayer pattern discussed above, the demosaicing task is 
usually understood as having X, ^X 2 =2. That is, one attempts to construct one output (RGB) 
15 pixel for each physical sensor in the input array. 



In the method of the present invention, the output pixels are related to the input pixels by a 
linear operator that operates on vectors derived from x[n] and y[m]. These intermediate 
vectors take into account the difference in resolution and the fact that each output pixel 
20 depends on more than one input super pixel. The intermediate vector corresponding to y[m] 
will be denoted by Qn] and has the same sampling density as x[n]: 
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y[A(n + 8 1 )] ' 
y[A(n + 5 2 )] 

y[A(n + 6^ 2 )] 



(3) 



Here, the matrix A is defined by 



A = 



\ 0 

0 x 7 



(4) 



In the case of the Bayer pattern, §,=[0,0], S^l/^O], 6^0,1/2], and 6,=[l/2,l/2]. The 
vectors ^[n] will be referred to as the output polyphase components in the following 
discussion. 



In the method of the present invention, it is assumed that each polyphase output vector 
depends on a finite number of input super pixels. In general, the input super pixels that 

10 contribute to a particular polyphase output vector £[n] will be located in a neighborhood 
around [n]. As will be explained in more detail below, the precise pixels will depend on the 
nature of the camera and imaging optics. The input super pixels that contribute to the 
polyphase output vector at n may be identified by a set of displacement vectors k l5 k 2 , . . ., k K . 
That is, Qn] depends on x[n+k,], x[n+k 2 ], x[n+k K ]. In the method of the present 

15 invention, CM is assumed to be linearly dependent on the input super pixels. In the preferred 
embodiment of the present invention, the set of displacement vectors k„ k 2 , k K is 
independent of n, and is arranged in a kj x k 2 rectangular grid 
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The linear relationship can be most easily defined in terms of a vector £[n] which includes all 
of the super pixels on which the output polyphase vector £[n] depends, i.e., 



5W = 



5 



(5) 



"x[n + kj" 
_x[n + k K ]_ 

In terms of this vector, the relationship between Qn] and §[n] may be written as matrix 
multiplication: 

S[n]=T$[n] ( 6 ) 

where T is a (QJ^A^xQPK^Kj) matrix. 



Refer now to Figures 2 and 3 which illustrate the relationships between the output pixels 
Y [N ? M], the input pixels x^N^M] and the two intermediate vectors defined above for the 
10 Bayer sensor pattern. Figure 2 illustrates a portion of a sensor array and the input pixels in 
the sensor array which contribute to £[N,M]. Figure 3 illustrates a portion of an output RGB 
image and the pixels in the RGB output image that correspond to C[N,M] and which are 
computed from the pixels shown in Figure 2 by the matrix multiplication operation shown in 
Eq. (6). 

15 

The matrix, T, depends on a number of factors. Some of these are fixed for a particular 
imaging device and some depend on the particular manner in which the imaging device is 
being utilized. For example, the physical properties of the sensing array such as the spectral 
sensitivity of the pixels, the mosaic pattern, and the number of pixels typically do not vary 
20 from image to image. In contrast, the optical properties of the imaging device such as the 
lens settings on a camera (f number and zoom) may vary from image to image. In addition, 
the spectral properties of the illumination source may vary from image to image (daylight, 
flash, incandescent light, etc.). 
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In addition, the statistics of the image being captured may be taken into account through T. 
For example, images or portions of images having a high content of vertical and horizontal 
edges can be processed with a different matrix than images lacking such features and thereby 
5 improve the output image quality. 

In cameras that have a variable output format, the resolution of the final picture can be set 
using a different T matrix. Alternatively, a single T matrix may be utilized for all resolutions 
and then the desired output image determined by re-sampling the fixed resolution image. 
10 Similarly, the number of output color planes may be altered by using different T matrices or 
by resampling a single color format to generate an alternate color representation. In general, 
properties that alter the dimension of the T matrix are preferably handled by using a fixed T 
matrix and then re-sampling the final image in low cost imaging devices. 

15 If the number of different T matrices is relatively small, the coefficients of the T matrix can 
be determined by training the system on known images. For each possible T matrix, images 
of a number of known scenes are taken using the imaging device. The coefficients of the T 
matrix are then computed so as to minimize the difference between the image computed from 
the sensor input and the known scene images. Such optimization computations are well 

20 known to those skilled in the art, and hence, will not be discussed in detail here. 

If the variation in some parameter such as f-number is relatively smooth, the T matrices need 
only be computed for a discrete number of values of the parameter. The correct T matrix for 
the non-computed variable parameter values can then be computed by interpolation of the 
25 computed T matrices. 
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Model-Based Computation of T 

As noted previously, in some circumstances it may be possible to compute appropriate 
matrices, T, from training images. Unfortunately, this approach is limited to appli- 
cations in which the number of different imaging conditions and hence the number 
of different T matrices which could be required is relatively small. The purpose of 
the material presented here is to describe a method for directly computing T for an 
arbitrary imaging device (i.e. arbitrary color sensitivities, sensor locations and optical 
characteristics) and under arbitrary illumination, subject to a particular statistical 
model for the underlying image, which has been found to give particularly good re- 
constructed image quality. As will be seen, the statistical image model is governed 
by only a few parameters. In more advanced applications, these parameters may be 
adjusted, either locally, or globally, to match statistical properties of the image, such 
as edge orientation, which can be estimated by various methods. 

Image Formation Model 

This section describes the parameters of the image formation process which maps the 
original scene into the source image super-pixels, x[n]. The image formation model 
depends upon deterministic quantities which can, at least in theory, be measured. 
These quantities are 

• The scene illuminant spectral power density, ?(A). 

• The color spectral response functions, r p (A), for each input image plane, p. 

• The Point Spread Function, h p (X, s), associated with the combined effects of the 
optical transfer function and sensor integration behaviour for input image plane 
p. Here s = [$i,s 2 ] is the spatially continuous argument of the Point Spread 
Function (PSF), at each wavelength, A. Note that the PSF's implicitly include 
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the effect of relative displacements between the different input image planes. 
In the following, the PSF is referenced only through its Fourier Transform, 
&p(Aa>), where the spatial frequency vector, w = [o^a^], is normalized so that 
Wl = o; 3 = 7T at the Nyquist frequency of the input super-pixels. Thus, the 
Nyquist frequency of the original sensor array corresponds to a? = [Aitt, X 2 w]. 

Rather than modeling the image formation process directly in terms of the desired out- 
put image super-pixels, it is helpful to choose an intermediate representation in terms 
of surface spectral-reflectance, since this is well known to be better behaved from a 
statistical perspective than the scene radiance itself and hence more amenable to the 
statistical modelling described in the next section. Specifically it is helpful to assume 
that the spectral reflectance of the original scene can be perfectly represented as a lin- 
ear combination of a limited number of fixed basis functions, 61(A), b 2 (X), . . . , 65(A), 
where S is usually chosen to be three or four, but may be larger if desired. The 
actual output vectors, y[m], may be expressed in terms of the intermediate spectral 
reflectance vectors, z[m] as 

y[m] = T out * z[m] 

f I^d 1 (X)l(X)b 1 {X)dX f£° d x (X)l(X)b 2 (X)dX JS° drWWbsWdX } 

JT d 2 (X)l(X)b x (X)dX Jo°° d 2 (X)l(X)b 2 (X)dX d 2 (X)l(X)b s (X)dX 

{ I^ 0 d Q (X)l(X)b 1 (X)dX 1^ d Q (X)l(X)b 2 (X)dX ... d Q (X)l(X)b s (X)dX j 

where d q (X) is the spectral response of the g'th display spectral response function. For 
example, if the objective is to recover an XYZ image, then Q should be set to 3 and 
di(A) through d 3 (A) should be set to the standard 1931 CIE tri-stimulus functions. 
As another example, if the objective is to recover an image with the same color 
characteristics as the different color filters on the physical sensor array, then Q should 
be set to the number of unique input response functions, r p (A), and there should 
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be a one-to-one correspondence between these unique r p (A) and the d q (X). In this 
framework, the chief goal is to compute the (SXi A 2 ) x {PK X K 2 ) reconstruction matrix, 
T re / which maps the neighbourhood of input super-pixels, £[n], to the corresponding 
spectral-reflectance super-pixel, 



C>] = 



z[A(n + 5i)] 
z[A(n + 8 2 )\ 



\ z[A(n + <? Al A 2 )] / 

The final (QAiA 2 ) x (PK1K2) reconstruction matrix is then formed by simple matrix 
multiplication: 

/ nn n n n n \ 



T = 



Tout 0 0 
0 T out 0 



0 



0 



0 



0 0 
0 0 

0 T 



ref 



CO 



out / 



The linear image formation model may now be expressed compactly as 

x(w) = v{u>) + H(w a )z(A- 1 a; a ) 

where 



• x(w) is the Discrete Space Fourier Transform of the input image, x[n]; 

• u{J) is the Discrete Space Fourier Transform of the sampling noise vector se- 
quence; 

• z(u>) is the Discrete Space Fourier Transform of the spectral reflectance vector, 
z[m]; 
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ft a (w), is the set containing all AiA 2 aliasing frequencies associated with the 
sampling of the high resolution output image on grid [m] onto the input super- 
pixel grid [n], for each u; € [— 7r,7r] 2 ; and 



• H(a;) is the P x S image formation matrix, 
r 2 (X)l(X)h(X)h 2 (X,u;)dX •• 



H(u/) = 



/ 0 °° r P {X)l(X)h{X)hp(X^)dX -• 



J 0 TO r 1 (A)^(A)6 s (A)^(A,a;)rfA 
J 0 ~r 2 (A)Z(A)& s (A)fc 2 (A,w)dA 

/ 0 oo rp(A)/(A)6 5 (A)fcp(A,u;)dA / 



Statistical Model 



In order to compute an appropriate solution to the image reconstruction problem, 
it is necessary to introduce a statistical model for the sampling noise, i>(w), and the 
spectral reflectance, z(u>). In this discussion, Wide Sense Stationary Gaussian models 
are assumed, which are characterized entirely by the covariance matrices, 



C„(u;) = E [&(«>) ■ u("Y] 



and 



C B (w) = E [z(w) ■ z(w)*] 

The noise covariance matrix will usually be a constant, C„(w) = <r 2 I, for all a>, 
corresponding to white noise, but other models may be used if appropriate. 

The following parametric model is used for the reflectance covariance matrix, 

C z (a;) = C z 0 .||ra;||"^ (8) 

where C z ° is a constant S x S covariance matrix, p is a frequency roll-off parameter, 
which is usually selected in the range 20 to 30 dB/decade, and T is a 2 x 2 "shape 
matrix". The terms in the above expression which follow the constant covariance 
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matrix, C z °, describe a scalar envelope function whose contours are elipses in the 
frequency domain. The orientation and aspect ratio of these eliptic contours may 
be explicitly controlled by means of the T matrix. For a circular cross-section, the 
identity matrix, T = I, may be used. 

The statistical model represented by C z plays an extremely important role in deter- 
mining the quality of the final reconstructed images. The parametric model described 
above may be justified on a number of grounds; most notably, the model is scale- 
invariant, which means that on average the statistics of scenes should not depend 
upon how far the camera is located from the scene. This scale- invariance property is 
important because in practical imaging applications, information about the absolute 
scale of objects in the scene is rarely available. Also, there is significant empirical 
evidence for this scale invariance property in natural scenes, with a frequency roll-off 
factor, /?, of about 20 dB/decade. 

Efficient Computation of T 

As mentioned above, the key objective is to compute the (5'A 1 A 2 ) x (PifiK 2 ) matrix, 
T re £, from which T is easily recovered via equation (7). The ensuing discussion 
concerns the derivation of an optimal Linear Minimum Mean Squared Error (LMMSE) 
estimator, T re -f, subject to the models described in the previous two sections. The 
formula for such an estimator is well-known. Specifically, 

T re£ = Z • X- 1 (9) 

where Z is the (£'A 1 A 2 ) x (PK 1 K 2 ) cross-covariance matrix, 

Z = E [C[n] • £[nf] 
and X is the (PKiK 2 ) x (PK\K 2 ) dimensional auto-covariance matrix, 

X = E \([n] ■ tin}*} 
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In fact, X has the following Toeplitz block-Toeplitz structure, 



/ 



x = 



X[0] 

X[-l] X[0] 



x[i] ••• MK2-1] \ 



\X[1-K 2 ] X[2-iiT 2 ] - 
where each block, X[Z 2 ], has the Toeplitz form 

/ xjo,/ 2 ] 

X[-l,/ 2 ] 



S[i,z 3 ] 

SO, / 2 ] 



X[/^ 2 - 2] 
X[0] ) 



X[#i-2,/ 3 ] 



\I[i-iri,i 2 ] ^[2-/^,^] ••• x[o,/ 2 ] j 

and each sub-block, X[/i,/ 2 ], is a P x P source super-pixel covariance matrix, given 
by 

X[l] = E [x[n] • x[n + 1]'] 

The (5A1A2) x (PKiK 2 ) matrix, Z, also has a doubly-nested block structure. Specif- 
ically, 





K 2 






to 





where 



and the {SX\X 2 ) x P sub-block matrices, £[1], are given by 

Z[l] = E [C'[n] • fln + 1]*] . 

In order to compute T re f, then, it is sufficient to compute the matrices for 
[1] = l 2 ) in the range -K 4 < k < Ki and the matrices £[1], for [1] = [l u l 2 ] in the 
range - [^J < k < [^^J ? after which the contents of X and Z may be filled in 
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and used to evaluate equation (9). The key to efficient computation of T re j lies in 
efficiently computing the matrices, anc * £[!]. 

It turns out that these matrices may be efficiently computed by exploiting ParsevaTs 
Relationship. Specifically, 

X[-l] = 7^ f dt* f ^e JW,1 Cx( W ) (10) 

(Z7T j J -tv J-k 

and 

Z[_l] = -J— f M f dw^C^a;) (11) 

where the frequency auto- and cross-covariance matrices, C x (u;) and C C / x (u>) are 
found from 

c x (w) = + £ (hk)c z k)hk)*) 

and 

C c .x(w) = £ (*MC,(w 0 )H(w o r) 

Here, <&(w) is the (SAiA 2 ) x 5 matrix of phase shifts, 



(12) 



(13) 



corresponding to the relative dispacements of each of the output polyphase compo- 
nents. 

In order to compute the matrices, X[l] and 2L[1], C C / x (u;) and C? x (u>) are evaluated at 
a finite number of frequencies, w € [-7r,7r] 2 and then the Inverse Fourier Transform 
(IFT) integrals of equations (10) and (11) are approximated numerically. There are 
various approaches to determining the best set of frequency points at which to evaluate 
X[l] and ZJ1] and interpolating between these points during the numerical integration 
procedure, but these are beyond the scope of this brief discussion. 
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Various modifications to the present invention will become apparent to those skilled 
in the art from the foregoing description and accompanying drawings. Accordingly, the 
present invention is to be limited solely by the scope of the following claims. 



WHAT IS CLAIMED IS: 



1 . A method for operating a data processing system to generate a second 
image from a first image, said first image comprising a two dimensional array of pixel 
5 values, each of said pixel values corresponding to the light intensity in one of a 

plurality of spectral bands at a location in said first image, said method comprising the 
steps of: 

separating said pixels of said first image into a plurality of input image planes, 
10 each input image plane having an identical number of pixels within a normalized 
horizontal and vertical sampling interval as the other input image planes, and all 
pixels in a given input image plane having the same spectral band as the other pixels 
in that input image plane; 

15 representing said first image as a set of super input pixels, each of said super 

input pixels being a vector of dimension P, where P is the number of said input image 
planes, each component of that vector being an input pixel from a corresponding input 
image plane; 

20 defining a set of output image planes, each pixel in a given output image plane 

representing the intensity of said second image in one of a plurality of spectral bands 
at a corresponding point in said second image; 

representing said second image as a set of super output pixels, each super 
25 output pixel being a vector of dimension Q, where Q is the number of said output 
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image planes, each component of that vector being a pixel from a corresponding 
output image plane; and 

applying a linear transformation to a vector derived from said super input 
pixels to obtain a vector comprising at least one of said super output pixels. 

2. The method of Claim 1 wherein said first image is generated by an optical 
device having a lens system for imaging a scene onto an array of photosensitive 
detectors, and wherein said linear transformation depends on a property of said lens 
system. 

3 . The method of Claim 2 wherein said property is the focal length of said 
lens system. 

4. The method of Claim 2 wherein said property is the f-number of said lens 

system. 

5. The method of Claim 1 wherein said linear transformation depends on the 
source of illumination used to generate said first image. 

6. The method of Claim 1 wherein said linear transformation depends on the 
type of scene captured in said first image. 

7. The method of Claim 1 wherein said linear transformation depends on the 
output format of said second image. 
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Image Demosaicing and Enhancement System 
ABSTRACT 

A method for operating a data processing system to generate a second image 
5 from a first image. The first image includes a two dimensional array of pixel values, 
each pixel value corresponding to the light intensity in one of a plurality of spectral 
bands at a location in the first image. The method utilizes a linear transformation of a 
vector derived from super input pixels to obtain a vector that includes at least one 
super output pixel. The super input pixels are defined by separating the pixels of the 
10 first image into a plurality of input image planes having identical numbers of pixels 
corresponding to the same spectral band. Each super input pixel is a vector of 
dimension P 5 where P is the number of the input image planes. Similarly, a set of 
output image planes is defined, each pixel in a given output image plane representing 
the intensity of the second image in one of a plurality of spectral bands at a 
1 5 corresponding point in the second image. Each super output pixel is a vector of 

dimension Q, where Q is the number of the output image planes, each component of 
that vector being a pixel from a corresponding output image plane. In the preferred 
embodiment of the present invention, the linear transformation depends on the 
properties of the optical system and the illumination source used to generate the first 
20 image. 
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